Context-Dependent SMT Model using Bilingual Verb-Noun Collocation

نویسندگان

  • Young-Sook Hwang
  • Yutaka Sasaki
چکیده

In this paper, we propose a new contextdependent SMT model that is tightly coupled with a language model. It is designed to decrease the translation ambiguities and efficiently search for an optimal hypothesis by reducing the hypothesis search space. It works through reciprocal incorporation between source and target context: a source word is determined by the context of previous and corresponding target words and the next target word is predicted by the pair consisting of the previous target word and its corresponding source word. In order to alleviate the data sparseness in chunk-based translation, we take a stepwise back-off translation strategy. Moreover, in order to obtain more semantically plausible translation results, we use bilingual verb-noun collocations; these are automatically extracted by using chunk alignment and a monolingual dependency parser. As a case study, we experimented on the language pair of Japanese and Korean. As a result, we could not only reduce the search space but also improve the performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effects of Web-based Concordancing Instruction on EFL Students’ Learning of Verb –Noun Collocations

This study investigates the influence of using five web-based practice units on English verb-noun collocations with the design of a web-based Chinese-English bilingual concordancer (keyword retrieval program) on collocation learning. Thirty-two college EFL students participated by taking a pre-test and two post-tests, and responding to a background questionnaire and an evaluation questionnaire....

متن کامل

Evaluation of Google and Bing online translation of verb-noun collocations from English into Arabic

This article aims to investigate and evaluate the translation of verb-noun collocation in English into Arabic Google and Bing online translation engines. A number of sentences were used as a testing dataset to evaluate both engines. Human translations by three bilingual speakers were used as a gold standard. A simple evaluation metric was proposed to calculate the translation accuracy of verb-n...

متن کامل

Retrieving Bilingual Verb-Noun Collocations by Integrating Cross-Language Category Hierarchies

This paper presents a method of retrieving bilingual collocations of a verb and its objective noun from cross-lingual documents with similar contents. Relevant documents are obtained by integrating crosslanguage hierarchies. The results showed a 15.1% improvement over the baseline nonhierarchy model, and a 6.0% improvement over use of relevant documents retrieved from a single hierarchy. Moreov...

متن کامل

Online Verb-Noun Collocation Instruction with the Support of a Bilingual Concordancer

Studies on language learning have revealed that collocation knowledge is an important indicator for advanced second language competence. Native speakers' production relies heavily on prefabricated or pre-made chunks such as collocations, which save processing load to achieve language fluency and accuracy. Previous investigation on ESL and EFL students indicated that nonnative learners' collocat...

متن کامل

Translating Collocation using Monolingual and Parallel Corpus

In this paper, we propose a method for translating a given verb-noun collocation based on a parallel corpus and an additional monolingual corpus. Our approach involves two models to generate collocation translations. The combination translation model generates combined translations of the collocate and the base word, and filters translations by a target language model from a monolingual corpus,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005